An Approach for Generating an XML Data Warehouse Schema using Model Transformation Language
نویسندگان
چکیده
Traditionally, the multidimensional schema of the data warehouse is derived from data sources that are mainly the company’s internal data, well-known and structured, by identifying facts, dimensions and numeric measurements through a manual analysis of the operational schemas. With the proliferation of new platforms of communication in today’s information societies, there has been growing numbers of web-based applications such as online social networks that generate huge amounts of XML data on the web. Therefore, it is increasingly important to develop an appropriate warehousing approach for such ever-growing XML data sources. However, XML documents have a complex hierarchical structure. Moreover, designing and building DWs is tedious, timeconsuming, error-prone and expensive process. In this paper, we describe an approach for automatically generating and building the star schema for data warehouse from XML schema. This approach is extensively based on standards (UML, XML, QVT, and XSLT). First, we model the structure of XML (XML Schema) using the Unified Modeling Language, which is the standard language for object oriented analysis and design. Then, we provide an algorithm that automatically selects the multidimensional concepts. After that, a representation in an XML schema language as the XML data warehouse schema description is derived automatically from UML star schema. Furthermore, we choose Query/Views/ Transformation (QVT), which is also OMG standard transformation language for defining and formalizing transformations between models. Finally, a prototype tool is implemented for testing and evaluating our approach and its transformations. Subject Categories and Descriptors H.2.7 [Database Administration]: Data warehouse and repository; D.3.3 [Language Constructs and Features] Data types and structures General Terms : Data warehouse Design, XML, XML Schema
منابع مشابه
Towards Business Intelligence over Unified Structured and Unstructured Data Using XML
Traditional data warehousing has been very successful in helping business enterprises to make intelligent decisions through declarative analysis of large amount of structured data stored in a relational database. However, not all enterprise data naturally fit into a relational model. Within an enterprise, there are huge amount of unstructured data, such as document content, emails, spreadsheets...
متن کاملConversion of XML Schema to Data Warehouse Schema using Automatic Approach
eXtensible Markup Language (XML) is data exchange format for representation data in Web based system. XML is used by many organizations for e-commerce and internet based applications such as online shopping, digital library, and electronic devices and so on. XML data is not sufficient to analyze on the Web. So XML is required to systematically analyze by industrial organizations to enable enhan...
متن کاملConceptual and Systematic Design Approach for XML Document Warehouses
EXtensible Markup Language (XML) has emerged as the dominant standard in describing and exchanging data among heterogeneous data sources. The increasing presence of large volumes of data appearing creates the need to investigate XML Document Warehouses as a means of handling the data. In this paper our focus is twofold. First we utilise Object Oriented (OO) concepts to develop and propose a con...
متن کاملDevelopment of Secure XML Data Warehouses with QVT
Context: Data warehouses are systems which integrate heterogeneous sources to support the decision making process. Data from the Web is becoming increasingly more important as sources for these systems, which has motivated the extensive use of XML to facilitate data and metadata interchange among heterogeneous data sources from the Web and the data warehouse. However, the business information t...
متن کاملLightweight Transformation of Data Models from SQL Schemas to UML-ER
Data modeling is an essential part of the software development process, and together with application modeling forms the core of the model-driven approach to software engineering. While UML is considered the standard for application modeling, there is really no corresponding open standard for data modeling. In this paper, we propose an approach and a tool to help bridge the gap between applicat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JDIM
دوره 12 شماره
صفحات -
تاریخ انتشار 2014